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The Goddard Earth Sciences Data and Information Services Center (GES DISC) at NASA has over the years developed and honed several 
reusable architectural components for supporting large-scale data centers with a large customer base. These include a processing system (S4PM) 
and an archive system (S4PA) based upon a workflow engine called the Simple, Scalable, Script-based Science Processor (S4P); and an online 
data visualization and analysis system (Giovanni). These subsystems are currently reused internally in a variety of combinations to implement 
customized data management on behalf of instrument science teams and other science investigators. Some of these subsystems (S4P and 
S4PM) have also been reused by other data centers for operational science processing. 
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3 developed and honed several Our experience has been that development and utilization of robust, interoperable, and reusable software systems can actually 

ide a processing system (S4PM) flourish in environments defined by heterogeneous commodity hardware systems, the emphasis on value-added customer service, 

Processor (S4P); and an online and the continual goal for achieving higher cost efficiencies. The repeated internal reuse that is fostered by such an environment 

:y of combinations to implement encourages and even forces changes to the software that make it more reusable and adaptable. Allowing and even encouraging 

of these subsystems (S4P and suc h selective pressures to software development has been a key factor in the success of S4P and S4PM, which are now 
available to the open source community under the NASA Open Source Agreement. 


The Simple, Scalable, Script-based, Science Processor for 
Measurements (S4PM) was a processing system conoeived 
and developed in 2001 to support the processing of data 
from MODIS 1 in the GES DISC. Built upon the proven S4P 2 
core, S4PM developers quickly developed a capable 
processing system optimized to support MODIS algorithms 
and meet the reprocessing schedule. In 2002, S4PM was 
enhanced to process data from the Atmospheric Infrared 
Sounder. The influx of new, instrument-unique 
requirements forced an S4PM refactoring toward a more 
modular architecture, particularly at the interfaces where 
algorithms are plugged in. A subsequent requirement to 
support multiple platforms (Sun, SGI, Linux) provided the 
impetus to enhanoe portability. And the need for S4PM to 
be installed, deployed, and configured quickly by multiple 
users resulted in better packaging and more automated 
configuration. This led to public release of S4PM under the 
NASA Open Source Agreement in 2004. S4PM was then 
reused by Langley Research Center to process data from 
CALIPSO 3 , MlSR 4 , and FLASHFlux 5 , and by EROS Data 
Center to process ASTER 6 data. 


The Simple, Scalable, Script-based, Science Product 
Archive (S4PA) was designed to handle the migration of 
legacy data from robotic tape archives to disk. Beginning 
in 2006, S4PA was tapped to replaoe the massive 
EOSDIS 1 Core System (ECS) data archive with a hard 
deadline at the end of 2007. The schedule pressure to 
stand up multiple disk-based instanoes of S4PA to 
accommodate the vast data volume (-400 TB), each with 
its own integration engineer, drove the development of 
more sophisticated packaging and easier, more 
automated installation mechanisms. 


Giovanni 1 is an online data visualization and analysis 
system that started out in 2002 as a Web-based CGI 2 
application. It provided an interactive session via HTTP 
through which users could select data parameters, regions 
of interest and time ranges and generate a variety of plots 
and images. Data supporting Giovanni were on the 
machine hosting it and could be downloaded by the user 
via FTP. As Giovanni grew in popularity, science users 
provided feedback, which drove enhancements which, in 
turn, expanded Giovanni’s popularity. Scientists seeking 
collaborations and interoperability with their systems 
followed. In 2005, Giovanni was then refactored from a 
CGI-based application to a services oriented architecture 
supported by a asynchronous workflow management 
system. Data no longer had to be on the host machine 
and could access remote data using an expanding list of 
protocols as well as multiple output protocols and data 
formats. 
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The Earth Science Data Systems Reuse Working Group has developed a DRAFT description of Reuse Readiness Levels (RRL), a metric for evaluating the reuse readiness of software. The top table 
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